A binary PSO approach to mine high-utility itemsets

نویسندگان

  • Chun-Wei Lin
  • Lu Yang
  • Philippe Fournier-Viger
  • Tzung-Pei Hong
  • Miroslav Voznak
چکیده

High-utility itemset mining (HUIM) is a critical issue in recent years since it can be used to reveal the profitable products by considering both the quantity and profit factors instead of frequent itemset mining (FIM) or association-rule mining (ARM). Several algorithms have been presented tomine high-utility itemsets (HUIs) andmost of them have to handle the exponential search space for discoveringHUIswhen the number of distinct items and the size of database are very large. In the past, a heuristic HUPEumuGRAM algorithm was proposed to mine HUIs based on genetic algorithm (GA). For the evolutionary computation Communicated by V. Loia. B Jerry Chun-Wei Lin [email protected] Lu Yang [email protected] Philippe Fournier-Viger [email protected] Tzung-Pei Hong [email protected] Miroslav Voznak [email protected] 1 School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China 2 School of Natural Sciences and Humanities, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China 3 Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, ROC 4 Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, ROC 5 Department of Telecommunications, Faculty of Electrical Engineering and Computer Science, VSB Technical University of Ostrava, Ostrava-Poruba, Czech Republic (EC) techniques of particle swarm optimization (PSO), it only requires fewer parameters compared to the GA-based approaches. Since the traditional PSO mechanism is used to handle the continuous problem, in this paper, the discrete PSO is adopted to encode the particles as the binary variables. An efficient PSO-based algorithm, namely HUIMBPSO, is proposed to efficiently find HUIs. The designed HUIM-BPSO algorithm finds the high-transaction-weighted utilization 1-itemsets (1-HTWUIs) as the size of the particles based on transaction-weighted utility (TWU) model, which can greatly reduce the combinational problem in evolution process. The sigmoid function is adopted in the updating process of the particles for the designed HUIM-BPSO algorithm. An OR/NOR-tree structure is further developed to reduce the invalid combinations for discovering HUIs. Substantial experiments on real-life datasets show that the proposed algorithm outperforms the other heuristic algorithms for mining HUIs in terms of execution time, number of discovered HUIs, and convergence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining high-utility itemsets based on particle swarm optimization

High-utility itemset mining (HUIM) is a critical issue in recent years since it can be used to reveal the profitable products by considering both the quantity and profit factors instead of frequent itemset mining (FIM) or association-rule mining (ARM). Several algorithms have been presented to mine high-utility itemsets (HUIs) and most of the designed algorithms have to handle the exponential s...

متن کامل

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

A Novel Algorithm for Mining Fuzzy High Utility Itemsets

Utility mining is to find the itemsets in a transaction database with high utility values like profits. Although a number of algorithms on high utility mining have been proposed, they did not reflect the fuzzy degree of quantity and profit level for mined high utility itemsets, which are essential for decision making in various applications like stock control and sales analysis. In this paper, ...

متن کامل

Efficient Mining of Temporal High Utility Itemsets from Data streams

Utility itemsets are considered as the different values of individual items as utilities, and utility mining aims at identifying the itemsets with high utilities. The temporal high utility itemsets are the itemsets with support larger than a pre-specified threshold in current time window of data stream. Discovery of temporal high utility itemsets is an important process for mining interesting p...

متن کامل

An efficient algorithm for mining temporal high utility itemsets from data streams

Utility of an itemset is considered as the value of this itemset, and utility mining aims at identifying the itemsets with high utilities. The temporal high utility itemsets are the itemsets whose support is larger than a pre-specified threshold in current time window of the data stream. Discovery of temporal high utility itemsets is an important process for mining interesting patterns like ass...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Soft Comput.

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2017